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APPROXIMATE POLYNOMIAL STRUCTURE 
IN ADDITIVELY LARGE SETS 

MAURO DI NASSO, ISAAC GOLDBRING, RENLING JIN, STEVEN LETH, MARTINO LUPINI, 

KARL MAHLBURG 


Abstract. We show that any subset of the natural numbers with positive logarithmic 
Banach density contains a set that is within a factor of two of a geometric progression, 
improving the bound on a previous result of the authors. Density conditions on 
subsets of the natural numbers that imply the existence of approximate powers of 
arithmetic progressions are developed and explored. 


1. Introduction 

In [I], the authors introduced a measure space, obtained by taking a quotient of a 
Loeb measure space, that has the property that multiplication is measure-preserving and 
for which standard sets of positive logarithmic density have positive measure. The log 
Banach density of a standard set (see Section 2 below for the definition) was also intro¬ 
duced, and this measure space framework was used, in conjunction with Furstenberg’s 
Recurrence Theorem, to obtain a standard result about the existence of approximate 
geometric progressions in sets of positive log Banach density. In this paper, we im¬ 
prove the bounds of approximation of this result by using Szemeredi’s Theorem together 
with a “logarithmic change of coordinates.” More specifically, in Proposition 3.1, we 
show that if A is a standard subset of the natural numbers, then the Banach density of 
{: x € A} is greater than or equal to the log Banach density of A. This allows 
us to use Szemeredi’s Theorem to show that every set of positive Banach log density 
contains a set which is “within a factor of 2” of being a geometric sequence; Theorem 
3.3 provides a precise version of this statement. We also explore a family of densities 
on the natural numbers, the (upper) r-Banach densities for 0 < r < 1, which have the 
property that positive 1/m-Banach density implies the existence of approximate mth 
powers of arithmetic progressions, in a sense made precise in Theorem 3.7. (This fam¬ 
ily of densities was introduced in [T], although BD m (A) in that paper corresponds to 
BD i /m {A) here.) 

In Section 2 we establish some properties of the log Banach density and the r-Banach 
densities, most notably that the log Banach density of a set A is always less than or 
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equal to every r-Banach density of A, and that if r < s then the ?’-Banach density of A 
is less than or equal to the s-Banach density of A (Theorem 2.12). These inequalities 
can both be strict. In fact, it is easy to see that the log Banach density of a set A 
could be 0 while every ?’-Banach density of A is 1, and in Example 2.13 we see that if 
r < s then it is possible to have the r-Banach density of a set A be 0 while the s-Banach 
density of A is 1. 

In Section 3 we establish the aforementioned approximation results and provide ex¬ 
amples to show that the level of approximation is optimal. 

We use nonstandard methods, which simplifies a number of the arguments. For an 
introduction to nonstandard methods aimed specifically toward applications to combi¬ 
natorial number theory, see [3]. 

1.1. Acknowledgements. This work was initiated during a week-long meeting at the 
American Institute for Mathematics on August 4-8, 2014 as part of the SQuaRE (Struc¬ 
tured Quartet Research Ensemble) project “Nonstandard Methods in Number Theory.” 
The authors would like to thank the Institute for the opportunity and for the Institute’s 
hospitality during their stay. 


2. r-DENSITY AND LOGARITHMIC DENSITY 


Convention 2.1. In this paper, N denotes the set of positive natural numbers. For 
any real numbers a < b, we set [a,b] := {x € N : a < x < 6}. We make a similar 
convention for the intervals (a, 6], [a, b), and (a, 6). 

We recall some well-known densities on N. 


Definition 2.2. Suppose that A C N and 0 < r < 1. 

• The upper r-density of A is defined to be 

d r {A) := limsup-^ V — 

n—K>o Tl . , nr 

x£An[l,n 

• The lower r-density of A is defined to be 


1 


d r (A) := lirninf V 

oo n r * nrA- r 


n->-oo Ti' z ' X v 
#EAn[l,n] 


Note that d\(A) and d 1 (A) are simply the usual upper and lower asymptotic densities 
of A, respectively. For that reason, we omit the subscript r when r = 1. 


Definition 2.3. Suppose that A C N. Then: 

• The upper logarithmic density of A is defined to be 


ld(A) ■.= limsup -- 

n— kx) In ti 


E 


i£yln[i,n] 


1 

X 
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• The lower logarithmic density of A is defined to be 

IdiA) := liminf —. 

n^oo In n x 

i£j 4 (l[l,n] 

The following result establishing relationships amongst the above densities was proven 

in Pi- 

Fact 2.4. For iCN and 0 < r < s < 1, we have 

d s (A) < d r (A) < ld(A) < Td(A) < d r {A) < d s (A). 


In working with these densities, we often use the following elementary estimates 
(established using an integral approximation): for any a < b in N, we have 

^ 1 b r — a r A _J_ 

<^1 7 * X 1 V 

x=a x=a +1 

Theorem 2.5. For 4CN and 0 < r < 1, we have 

dr(A) > 1 - (1 -d{A)) r . 


Proof. Set a := d(A) and take H € *N \ N such that ~ a. where N := |*An [1,R]|. 
Set e := -tt — ct, so e is a (possibly negative) infinitesimal. We now have 


d r (A) > st 


(s. E _L) 

\ H r ^ x l ~ r 
\ xg *An[i,J?] / 


> 


> st 


xG ( H-N,H] 

r H r - (H - N) r 
IF r 


= St (l-(l-(a + e) n 

= 1 — (1 — a) r . □ 


Corollary 2.6. If d(A) > 0, then d r (A) > 0 for all 0 < r < 1. 

Remark 2.7. It is easy to construct a set A C N with d(A) = 1 and ld(A) = 0. As a 
consequence of the theorem above, we also have d r (A) = 1 for any 0 < r < 1. 

We now introduce the corresponding uniform versions of the above densities. 


Definition 2.8. For iCN and 0 < r < 1, the (upper) r-Banach density of A is defined 
to be 


BD r (A) 


TeN'i , r ,„ ,, 

x£Ar\[k,(k r -\-n) 1 / r 
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Note that BD(H) = BDi(H) is the usual upper Banach density of A. Note also that 
d r (A ) < BD r (T) from definition. 

Definition 2.9. For ACN, the (upper) log Banach density of A is 

£BD(T) := lim sup — 

n->oo k>l Inn 

Of course one could also define the lower r-Banach density and the lower log Banach 
density, but in this paper we only focus on the upper r-Banach density and upper log 
Banach density. 

The following nonstandard formulation of r-Banach density and log Banach density 
follows immediately from the nonstandard characterization of limit. 



EAn[k,nk\ 


Proposition 2.10. Let A C N, 0 < r < 1, and 0 < a < 1. 

(1) BD r (H) > a if and only if there are k,N E *N with N > N such that 

1 


st 


r 

N 


E 


A—r | — 


> a. 


x£An[k,(k r +N) 1 / r ] 

(2) LBD(H) > a if and only if there are k,N € *N with N > N such that 


st 


— ]T 

In N ^ x I 
k xe*An[k,Nk] ) 


> a. 


We now establish the uniform version of Fact 12.41 above. The results in [2j do not 
immediately apply in the uniform setting. Nevertheless, our proof is inspired by the 
arguments from [2], although we argue in the nonstandard model to make the idea more 
transparent. 


Theorem 2.11. For any ACN and 0 < r < s < 1, we have 

FSD(A) < BD r (T) < BD S (A). 


Proof. We first prove that BD r (T) < BD S (^4). 

Let 0 < a < 1 be such that BD S (^4) < a. ft suffices to show that /3 := 
By Proposition 12.101 we can find a, b € *N such that b r — a r > N and 


BD r (T) = st 



BD r (^4) < a. 


Here, xa denotes the characteristic function of (the nonstandard extension of) A. Next 
note that if c, d E *N are such that d s — c s > N, then Proposition 12.101 once again implies 
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Choose m £ [a, b] such that m r — a r > N and 

/ rn \ / b 


2 n l-r E n l 


0. 


(For example, let m = \((b r — a r ) 1//2 + a r ) 1 ^ r ~\.) Since x x s — x r is an increasing 
function, we have that — a s > m r — a r . Hence > 1 - < a } — ; — for any 

/ ^ 70 F S / 70 F S 


i > m. Now we have 


y~^ XA(n) _ y-* Xa( 'u) 1 

/ ^1 T / ^ 7-7 1 “ 


n=a 

b 


n x-s n s-r 


77/ 


n=a i=n 

b i 

EE 

i=a n=a 
6 i 


< a EE n i 



( 1 

1 

-s 1 / ^ 
\i=n 

\i s r 

(i + l) s “ 

XA(n) ( 

1 

1 

n 1_s \ 

i s ~ r 


XA{n) ( 

1 

1 

n l ~ s V 

is-r 

(i + l) s - r . 

- 1 ( 

' 1 

1 


+ 


(b + l) 6 


XA(n) 1 


n=a 

b 


—s V a: s—r 


(i + l) s 


n l ~ s 

(6 + l) s_r 

XA{n) 

1 

n 1 ~ s 

(6 + l) s-r 

b 


■Z 1 

1 


^ n 1_s (6 + l) 6 


+ a-«)EE^r 


—s V vs— r 


(f + i) s 


= a 


Y—y( _L 

n l s \ is 


n=a i=n 

m 


(i + iy 

+ < 1 -“)EtV7E(7V7- 


V 1 1 

+ 0 n i-S (5 + 1 ), 


= a 


Y — 

Z—/ 70 I 


n 1 

n=a i=ra 

1 1 


(i + l) s 


—s 70 S—r 


n 1 - * n 


in ^ 1 ra .. 

+ (! - «) E -(!-«) E Eu 


1 


^ n 1 s (m + l) s 


b 1 m 1 

n—a n=a 

( m 1 \ / b 1 

We conclude that (3 < a since E^ Ew 


-1 


0. 


In the arguments above, if we let r = 0 and instead require that ln(m) — ln(a) > N, 
then we get t'BD(H) < BD S (H). □ 


It is easy to see that the set A = 2n!] has the property that tBD(H) = 0 

while BD r (yl) = 1 for every r € (0,1]. The following example shows that the r-Banach 
densities can also disagree to this extent. 
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Example 2.12. For any 0 < r < s < 1 there is a set A C N such that BD r (A) = 0 and 
BD s (j4) = 1. 

Proof. Let ( a n ) be a sequence of positive integers defined by setting a\ to be any integer 
larger than 1 and a n+ \ := af . Let 

o° r i / 

.4 = [J W‘\ (<" + l) 

n= 1 - 

We show that BD r (yl) = 0 and BD S (^4) = 1. 

Suppose that k, N G *N with N > N are such that 

BD A4) = st(A £ -P 

\ xG*An[k,(k r +N) 1 / r ] 

Let v be the maximal m £ *N such that 

[ a m’ S 1 ( a m S + l) 1/r n [k, {k r + N) llr ] / 0. 

Note then that 

*A n [k, ( k r + AT) 1 /-] C [A;, + l) 1/r ] U [a 1 / 3 , ( a l J s + l) 1 /*]. 

The latter interval is negligble: 




= st (^ 

= 0 . 

Next observe that < 2(^J) 1 / rs < 2 y/(k r + N)V r ). If 2^(k r + iV) 1 /^ < 

/c. then the above computation shows that BD r ( J 4) = 0. Thus, we may assume that 
‘1\J(k r + N ) 1 / r ) > k, from which it is readily verified that N > k r . It follows that 


BD r (yl) < st 

= st 

< st 
= 0 . 
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For showing BD s (yl) = 1, it suffices to show that y (all* + 1 j J — aj r > N when 

n > N. Indeed, if N G *N \ N is such that ^a« s + 1 j ^ — al!, r > N, then *A 
contains the interval 

[«n /rS ,((«n AS ) S + ^) 1/S ]- 

Note that 

+ -al/ r 

= (a l r! s + l) (a l J s + l) ( V - a l J r 
> {a]! s + l) at r),{rs) ~a^ 

1 | s — r s — r 1 s — r 

= dn rs + - ah = . 

s—r 

It remains to observe that a n rs > N because a n > N and (s — r)/(rs ) is a positive 
standard real number. □ 

3. Polynomial structure and multiplicative structure 
In what follows, log denotes log 2 . For iCN, set 

log A := {|~logaf| : x G A}. 

We also introduce some convenient notation: for k, N G *N and E C *N, set 
Lk,N(E) = Y2x£Er\[k,Nk] V x ' 

Proposition 3.1. If A C N, we have BD(logyl) > PBD(A). 

Proof. Without loss of generality, we can assume that £BD(Al) = a > 0. Take k, N G *N 
with N > N so that st(Lk : N(*A)) = a. We first claim that we can assume that k and 
kN are integer powers of 2. Indeed, choose integers a,b such that 2 a_1 < k ^ 2 a and 
2 b ^ kN < 2 h+l . Note that h — a > N. Observe now that 

2°—1 Nk 

Y Y 

x=k x=2 b +l 

SO 

L kA* A ) ~^ I] V*- 

xe*An[2 a ,2 b ] 

It remains now to notice that ln(2 ft_1 ) < In N < ln(2 b_a ) + In 2, whence 

-—— i/x ~ -— t— i/x. 

In N In 2 b a 

x€* An[2 a ,2 b ] xe*An[2 a ,2 b ] 

In light of the previous paragraph, we may take a < b in *N so that st(L 2 ° 2 b ~ a (*A)) = 
a. For a < i < b, set I* := [2* + 1,2* +1 ]. Observe that |~log(x)] = i + 1 for all x G I;. 
Set X := {i : lid* A / 0}. We then have: 
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|log(M)n(o,6]| = |X| 

= ^log(2 i+1 )-log(2 i 


i&X 


> log(e) Y Y 

i&X xG*An[2 i ,2 i+1 ) 

Recalling that ln(2 6-a ) = Y{e) - ^ f°U° ws that 

E 


BD(logM)) > l 1 °g(M) n (a,i]| > )' - = a. 

K ” ~ b — a ~ ln2 b " a ^ x 

a;e*An[2“,2 6 -l) 


□ 


We now come to the central notion of this paper. 

Definition 3.2. Fix c, r £ R >0 . 

(1) For a, x € M, we say that a is a (c, r)-approximation of x if a € [x,x + cx r ). 

(2) For A, X Cl, we say that A is a (c,r)-approximate subset of X if every a € A 
is an (c, r)-approximation of some x € X. 

Theorem 3.3. Suppose that A C N is such that ^BD(j4) > 0. Then for any l € N, 
there exist arbitrarily large a, d € N such that the geometric sequence G := {2 a (2 d ) n : 
n = 0,1,— 1} is a (1, 1)-approximate subset of A. 

Proof. Fix m € N. Since BD(log74) > TRD(A) > 0, the set log A contains an arithmetic 
progression {a + nd : n = 0,1,..., l — 1} with a,d > m. Fix n E {0, 1, ...,£ — 1} and 
take x € A and 6 € [0,1) such that a + nd = log x + 9. Then x < 2 a+nd = 2 e x < 2x. □ 

The following example shows that we cannot improve upon the level of approximation 
in the previous theorem. 

Example 3.4. For each e > 0, there is A C N such that ld(A ) = ld(A) > 0 and no 
positive integer power of 2 is a (1 — e, 1)-approximation of any element of A. 

Proof. Choose <5 > 0 such that (2 — e)2 s < 2. Set 

OO 

A:= \J[2 n + l,2 n+5 }- 

n =1 

Note that the interval [2 n+<5 , (2 — e)2 n+<5 ] does not contain any positive integer power of 
2 as 2 n+1 ^ (2 — e)2 n+s implies that 2 < (2 — e)2 5 . It follows that no power of 2 is a 
(1 — e, 1) approximation of any element of A. We leave it to the reader to show that 
ld(A) = ld(A) >6. □ 

Our next example shows that one cannot prove Theorem 13.31 under the weaker as¬ 
sumption of positive Banach density. 
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Example 3.5. Let a < 1. Fix a j such that (j — 1 )/j > a. Let uq = 2, Ui + \ > (jui) 3 , 
and set 

OO 

A = 1J [ui,jui\. 

i= 1 

Then d(A) > a. For any n E N, there exists an m £ N such that there does not exist 
3-term geometric progression G = {o, ar, ar 2 } with a,r > m and G is an [n — 1,1)- 
approximate subset of A. 

For a proof of the claim in the previous example, one can consult our paper [lj. 

For 1CN and 0 < r < 1, set 

A r :={|V1 :x£A}. 

One proves the following proposition in a manner similar to the proof of Proposition 

HO 


Proposition 3.6. For any 4CN, we have BD(H r ) > BD r (H). 


Theorem 3.7. Suppose A C N and m E N are such that BD i/ m (A) > 0. Then for 
any e > 0 and l € N, there exist arbitrarily large a, d € N such that {(a + nd) m : n = 
0.1,..., / — 1} is an (m + e, approximate subset of A. 


Proof. Fix p € N. Since BD(A 1 / m ) > BD m (Al) > 0, there are a,d > p such that 
{a + nd : n = 0,1,...,/ — 1} C A l / m . Choose a sufficiently large so that, for any 
z > a — 1, we have 


ez m ~ l > ( m ) z m - 2 + ( m \x m - 3 + --- + mx + 1. 

\m — 2) \rn — 3) 

Fix n E {0,1,...,/ — 1} and take x E A and 8 € [0,1) such that a + nd = x 1//m + 8. 
Since x 1//m > a — 1, we have 


x ^ (a + nd) m = (x"* + 8) rn = x + (m + e)x ™ 8 

m \ m — 2 „ ( m \ m —3 


i m —1 

+8 —ex ™ + 


m — 2 


m—2 

X rn 8 + 


m — 3 


X m 


+ ■■■ + 8 


m— 1 


/ x rn-± 

< X + [m + C)X rn . 


Hence (a + nd) m is an (rn + e, m ^-) -approximation of x E A. 


□ 


The next example shows that there is not much room left to improve upon the level 
of approximation in the previous theorem. 


Example 3.8. For any e > 0, there exists a 5 > 0 and there exists a set iCN such 
that di/ m (A) = di/ m (A) = 6 and such that, for any a € N, a m is not an (m — e, m ^-)- 
approximation of any element in A. 
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oo 

Proof. Fix 0 < 5 < e/m and set A := |^J [n m + 1, (n + 5) m ). Suppose, towards a 

n —1 

contradiction, that a m € [x,x + (m — for some x € [ n m + 1, (n + <5) m ). It 

follows that n + 1 < a, whence 

((n + 5) + ( 1 - <5)) m = (n + l) m ^ (n + <5) m + (m - e)(n + tf)™- 1 . 

Hence m(n + <5) m_1 (l — 5) ^ (m — e)(n + which implies that dm > e, a contra¬ 
diction. We leave it to the reader to check that d 1(/m (H) = dy m (A) =6. □ 

Corollary 12.61 and Theorem 13.71 immediately imply: 

Corollary 3.9. Suppose that A C N is such that d(A) > 0. Then for any l,m £ N and 
e > 0, there exists arbitrarily large a, d € N such that {(a + nd) m : n = 0,1,... , 1} is an 
(m + e, m ^-) -approximate subset of A. 

We should remark that the conclusions of approximate structure really are necessary. 
For example, if A is the set of all square-free numbers, then d(A) > 0 but A does not 
contain any 3-term geometric progression or any m-th power of an integer greater than 
1 with m > 2. 
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